Robust Hands-free Human-Robot Communication in Reverberant Environments
نویسندگان
چکیده
Speech-based human-robot interaction is often plagued with issues such as reverberation and changes in speaker position that impacts overall performance. In this paper, we show a method in compensating the joint effects of reverberation and the change in speaker position. The acoustic perturbation caused by these two takes its toll on the Automatic Speech Recognition (ASR) and then the Spoken Language Understanding (SLU). Consequently, these will lead to a failure in the human-robot interaction experience. The proposed method is specifically designed to address the challenging environment condition in which robots are deployed. First, we analyze the impact of reverberation in the form of temporal smearing per change in speaker position. Then, we extract the smearing coefficients that capture the joint dynamics between the speech signal at current position and the room acoustics as observed by the robot. These coefficients are utilized to update the room transfer function (RTF) and the suppression parameters are stored offline. Moreover, all of these processes are optimized in the context of the ASR system for robot application. In the online mode, the reverberant data at an arbitrary position is processed using the parameters pre-computed offline. This effectively compensates the joint effects of reverberation at the arbitrary speaker position. Experimental results using real data gathered in a human-robot communication setting show that the proposed method outperforms existing methods.
منابع مشابه
Extraction of Robust Features by Combining Noise Reduction and Fdlp for the Recognition of Noisy Speech Signals in Hands-free Mode
A processing scheme is presented to create robust acoustic features for the recognition of noisy speech signals at a hands-free speech input in reverberant environments. The robustness is achieved by extending a Mel cepstral analysis scheme by the additional processing steps of a noise reduction and a frequency domain linear prediction (FDLP). FDLP is a signal processing technique to modify the...
متن کاملSystem Request Utterance Detection Based on Acoustic and Linguistic Features
Robots are now being designed to become a part of the lives of ordinary people in social and home environments, such as a service robot at the office, or a robot serving people at a party (H. G. Okuno, et al., 2002 ) (J. Miura, et al., 2003). One of the key issues for practical use is the development of technologies that allow for user-friendly interfaces. This is because many robots that will ...
متن کاملDiscrete time robust control of robot manipulators in the task space using adaptive fuzzy estimator
This paper presents a discrete-time robust control for electrically driven robot manipulators in the task space. A novel discrete-time model-free control law is proposed by employing an adaptive fuzzy estimator for the compensation of the uncertainty including model uncertainty, external disturbances and discretization error. Parameters of the fuzzy estimator are adapted to minimize the estimat...
متن کاملRobust Hybrid Motion Force Control Algorithm for Robot Manipulators
In this paper we present a robust hybrid motion/force controller for rigid robot manipulators. The main contribution of this paper is that the proposed hybrid control system is able to accomplish motion objectives in free directions and force objectives in constrained directions under parametric uncertainty both in robot dynamics and stiffness constraint constant. Also, the given scheme is prov...
متن کاملA new approach for the adaptation of HMMs to reverberation and background noise
Looking at practical application scenarios of speech recognition systems several distortion effects exist that have a major influence on the speech signal and can considerably deteriorate the recognition performance. So far, mainly the influence of stationary background noise and of unknown frequency characteristics has been studied. A further distortion effect is the hands-free speech input in...
متن کامل